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Multi-valued network models are an important qualitative modelling approach used widely by the 
biological community. In this paper we consider developing an abstraction theory for multi-valued 
network models that allows the state space of a model to be reduced while preserving key properties 
of the model. This is important as it aids the analysis and comparison of multi-valued networks 
and in particular, helps address the well-known problem of state space explosion associated with 
such analysis. We also consider developing techniques for efficiently identifying abstractions and so 
provide a basis for the automation of this task. We illustrate the theory and techniques developed by 
investigating the identification of abstractions for two published MVN models of the lysis-lysogeny 
switch in the bacteriophage X. 

1 Introduction 

In order to understand and analyse the complex control mechanisms inherent in biological systems a 
range of formal modelling techniques have been applied by biologists (for an overview see for example 
HITl). In particular, qualitative modelling techniques have emerged as an important modelling approach 
due to the lack of quantitative data on reaction rates and the noise associated with such data. Multi-valued 
networks (MVNs) lfl5l [T8l [T9l are a promising qualitative modelling approach for biological systems. 
They extend the well-known Boolean network approach HJ SI by allowing the state of each regulatory 
entity to be within a range of discrete values instead of just on/off. In this way they are able to provide 
a compromise between the simplicity of Boolean networks and the more detailed differential equational 
models. 

However, the analysis of MVNs is not without problems. They suffer from the well-known state 
space explosion problem, a problem which is exacerbated in MVNs by the possibly large set of states 
associated with each individual entity. Another important shortcoming is the lack of any techniques for 
relating MVN models at different levels of abstraction. This hinders the comparison of MVN models 
and means there is no basis for the incremental development of MVNs. 

In this paper we begin to address these problems by developing an abstraction theory for MVNs. 
Abstraction techniques are a well established approach in the area of formal verification (see for example 
(HO) which allow a simpler model to be identified which can then be used to provide insight into 
the more complex original model. The abstraction theory we present is based on using an abstraction 
mapping to relate the reduced state space of an abstraction to the original MVN model. We develop a 
notion of what it means for one MVN to correctly abstract another and investigate the scope and limits of 
the analysis properties that can be inferred from an abstraction model. We show that abstractions allow 
sound analysis inferences about reachability properties in the sense that any reachability result shown on 
the abstraction must hold on the original model. Importantly, we show that all attractors of an abstraction 
correspond to attractors in the original model. 
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We illustrate the theory and techniques developed by investigating the existence of abstractions for 
two published MVN models for the genetic regulatory network controlling the lysis-lysogeny switch 
in the bacteriophage A ifTTl I51. Bacteriophage A HHO is a virus which after infecting the bacteria 
Escherichia coli makes a decision to switch to one of two possible reproductive phases. It can enter the 
lytic cycle where the virus generates as many new viral particles as the infected cell's resources allow 
and then lyse the cell wall to release the new phage. Alternatively, it can enter the lysogenic cycle where 
the A DNA integrates into the host DNA providing it with immunity from other phages and allowing 
it to be replicated with each cell division. We consider a two and four entity MVN model IfTTl of the 
lysis-lysogeny switching mechanism and using the techniques we have developed identify corresponding 
abstractions for these models. 

This paper is organized as follows. In Section 2 we provide a brief overview of the MVN modelling 
approach and present a simple illustrative example. In Section 3 we develop an abstraction theory for 
multi-valued networks and present a range of results concerning this theory. In Section 4 we consider 
the identification of abstractions and develop a basis for automating the abstraction process. In Section 
5 we illustrate the theory and techniques developed by presenting two abstraction examples for pub- 
lished models of the lysis-lysogeny switch in bacteriophage A. Finally, in Section 6 we present some 
concluding remarks and consider directions for future work. 

2 Multi-valued Network Models 

In this section, we introduce multi-valued networks (MVNs) lfl5l IT8l [191 , a qualitative modelling ap- 
proach which extends the well-known Boolean network fl] approach by allowing the state of each regu- 
latory entity to be within a range of discrete values. MVNs have been extensively studied in circuit design 
(for example, see lfT5l fl2l0 and successfully applied to modelling biological systems (for example, see 

An MVN consists of a set of logically linked entities G = {g\ ,gt} which regulate each other in 
a positive or negative way. Each entity gi in an MVN has an associated set of discrete states Y(gi) = 
{0,...,m;}, for some m, > 1, from which its current state is taken. Note that a Boolean network is 
therefore simply an MVN in which each entity gi has a Boolean set of states Y(gj) = {0, 1}. Each entity 
gi also has a neighbourhood N(gj) = {g^ ,gi l(j) } which is the set of all entities that can directly affect 
its state. (Note that gj may or may not be a member of N(gj).) Furthermore, interactions between one 
entity and another only become functional if the state of the source entity has reached some threshold 
state level (this threshold state level is always at least one). MVNs can therefore discriminate between 
the strengths of different interactions, something which Boolean networks are unable to capture. The 
behaviour of each entity g t based on these neighbourhood interactions is formally defined by a logical 
next-state function f g . which calculates the next-state of gj given the current states of the entities in its 
neighbourhood. 

We can now define an MVN more formally as follows. 

Definition 1. An MVN MV is a four-tuple MV = (G, Y,N,F) where: 

i) G = {g\ , . . . ,gk} is a non-empty, finite set of entities; 

ii) Y = (Y (gi), ■ . . ,y(gfc)) is a tuple of state sets, where each Y(gi) = {0, . . . ,«,•}, for some m, > 1, is the 
state space for entity g;; 

iii) N = (N(g\),. . . ,N(gk)) is a tuple of neighbourhoods, such that N(gi) C G is the neighbourhood of 
gi, and 
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iv) F = ,...,f gk ) is a tuple of next-state multi- valued functions, such that if N(gj) = {gj l , . . . ,gt n } 
then the function f gi : Y(gi ] ) x • • • x F(g, n ) — > Y(gi) defines the next state of gj. □ 

In the sequel, let MV = (G, Y,N,F) be an arbitrary MVN. In a slight abuse of notation we let gi 6 MV 
represent that gi G G is an entity in MV. 

As an example, consider the MVN Exl denned in Figured] which consists of two entities gi and g%, 
such that Y(gi) = {0, 1} and Y(g2) = {0, 1,2}. The update functions for each entity are defined using 
state transition tables (see Figure [2(b)) where [gi] is used to denote the next state of an entity gi. It can 
be seen that entity gi inhibits g2 and that entity g2 inhibits gi but only when it reaches state 2 (this is 
represented in Figure [2(a) by labelling the corresponding edge with a 2). Note that although g2 € N(g2) 
we have not drawn an edge for this in Figure [2 (a) since g2 has no regulatory affect on itself and is needed 
simply to allow the affect of g\ to be precisely defined. 

g2 I [g2] 

1 

1 2 

2 2 



1 

2 | 1 

(a) (b) 

Figure 1: An example MVN Exl which consists of two entities gi and g2, including: (a) network 
structure; and (b) the state transition tables representing the corresponding next-state functions. 

A global state of an MVN MV with k entities is represented by a tuple of states (s\,... ,Sk), where 
Si € Y(gi) represents the state of entity gi £ MV. Note as a notational convenience we often use s\ . . . to 
represent a global state (si,...,Sk)- When the current state of an MVN is clear from the context we allow 
gi to denote both the name of an entity and its corresponding current state. The state space of an MVN 
MV, denoted Smv, is therefore the set of all possible global states Smv = Y(gi) x • • • x Y (g^). The state of 
an MVN can be updated either synchronously, where the state of all entities is updated simultaneously in 
a single update step, or asynchronously, where entities update their state independently (see ||9l). In the 
following we focus on the synchronous update semantics since this has received considerable attention 
from the biological community. Given two states S\ , S2 £ Smv, let S\ — > S2 represent a synchronous 
update step such that S2 is the state that results from simultaneously updating the state of each entity gi 
using its associated update function f g . and the appropriate neighbourhood of states from Si . 

As an example, consider the global state 01 for Exl (see Figure [2) in which gi has state and g2 
has state 1. Then 01 — > 12 is a single synchronous update step on this state resulting in the new state 12. 
The sequence of global states through Smv from some initial state is called a trace. Note that in the case 
of a synchronous update semantics such traces are infinite. However, given that the global state space 
is finite, this implies that a trace must eventually enter a cycle, known formally as an attractor cycle 
lTTTl[T9l . We make use of this fact to define a finite canonical representation for traces which specifies a 
trace up to the first repeated state. 

Definition 2. Let So € Smv be a global state for MV. A trace is a list of global states g(Sq) = 
(Sq,S\,. ..,S n ) such that: 
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i) Sj -» Si+i, for < i < n; 

ii) So,.. ■ > Sb-i are unique states; and 

iii) S„ = 5, for some i € {0, ... ,n— 1}. □ 

The set of all traces Tr(MV) = {c(S) \ S 6 Smv} therefore completely characterizes the behaviour of 
an MVN model under the synchronous semantics and is referred to as the trace semantics of MV. 

In our running example, Exl has a state space of size \Sex1 I = 6 and so (under a synchronous update 
semantics) Tr(Exl) consists of the six traces presented in Figure|2](a) below. 

<t(oo) = (00,11,10,10) <r(io) = (io,io) (99>-^0]>^(^O 

a (01) = (01,12,01) tr(ll) = (11,10,10) (OjXZy^ 
a(02) = (02,02) (7(12) = (12,01, 12) 

(a) (b) 

Figure 2: The trace semantics for Exl: (a) the set of formal traces; and (b) a graphical representation of 
the traces. 

As mentioned above, each trace leads to a cyclic sequence of states known as an attractor cycle 
ifTTl [l9Tl . For example, in Figure [2(b) we can see that Exl has three attractors: 10 — > 10 and 02 — > 02 
known as point attractors; and 01 -4 02 — >• 01 which is an attractor cycle of period 2 ifTTTl . 

Given a trace a = (S\,...,S n ) <E 7>(MV) for an MVN MV we let att(a) denote the attractor cycle 
that must occur in trace a, i.e. att(o) = (5jt, Sfc+i,. ■ • ,5 n ), for some 1 < < n and 5^ = S n . We let 
ATT(MV) denote the set of all attractors for MV, i.e. 

ATT(MV) = {att(a) | a € Tr(MV)}. 

Attractor cycles are very important biologically where they are seen as representing different biolog- 
ical states or functions (e.g. different cellular types such as proliferation, apoptosis and differentiation 
iflOl ). Thus, the identification and analysis of attractor cycles for MVNs is an important subject which 
has warranted much attention in the literature (for example, see IfTTl [191 l8l). 



3 An Abstraction Theory for MVNs 

In this section we develop a notion of abstraction for MVNs by considering what it means for one MVN 
to abstractly implement the behaviour of another. This is based around the idea of showing that the 
trace semantics of one MVN is consistent with the trace semantics of a more complex MVN under an 
appropriate mapping of states. 

We begin by denning how an entity's state space can be simplified using a mapping to merge states. 

Definition 3. Let MV be an MVN and let gj G MV be an entity such that Y(gj) = {0, . . . ,m} for some 
m > 1. Then a state mapping (j>(gi) for entity gi is a surjective mapping <p(gi) : {0, . . . ,m} — > {0, . . . ,n}, 
where < n < m. □ 

The idea is that a state mapping reduces the set of states an entity can be in by merging appropriate 
states. The state mapping must be surjective to ensure that all states in the new reduced state space are 
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used. Note we only consider state mappings with a codomain larger than one, since a singular state entity 
does not appear to be of biological interest. 

As an example, consider entity g2 G Exl (see FigureQ]) which has the state space Y{g2) = {0, 1,2}. It 
is only meaningful to simplify g2 G Exl to a Boolean entity and so one possible state mapping to achieve 
this would be: 



which merges states and 1 into a single state 0, and translates state 2 into 1 . 

Clearly, there are a number of different possible state mappings which can be applied to reduce a 
node's state space from m to n states, for 1 < n < m. The complete set of all such state mappings is 
denoted MS(m,n) = { <p \ (j) : {0, . . . ,m — 1} — > {0, ... ,n— 1} and is surjective}. For example, the 
mapping setM5(3,2) consists of the following six mappings: 



In order to be able to consider simplifying several entities at the same time during the abstraction 
process we introduce the notion of a family of state mappings as follows. 

Definition 4. Let MV = (G, Y,N,F) be an MVN with entities G = {gu- ■■ ,gk}- Then an abstraction 
mapping (j> for MV is a family of mappings = ((j> (g\ ),..., <j> (gk)) such that for each 1 < i < k we have 
ty{gi) is either a state mapping for entity gt or is the identity mapping I gi : Y(gj) — > Y{gi) where I gi (s) = s, 
for all s G Y(gj). Furthermore, for (j> to be well-defined we insist that at least one of the mappings (j>(gi) 
is a state mapping. □ 

Note in the sequel given a state mapping <j>(gi) we let it denote both itself and the corresponding 
abstraction mapping containing only the single state mapping <p(gi). 

An abstraction mapping can be lifted and applied to the trace semantics of an MVN as follows. 

Definition 5. An abstraction mapping (j) = ((j>(gi) ■ ■ -<!>(gk)) for MV can be used to abstract a global 
state si . . .Sk G Smv by applying it pointwise, i.e. <j)(si . . . s^) = <j)(gi)(si) . . . (j)(gi c )(si c )- We can lift an 
abstraction mapping to a trace o(Sq) = (So, . . . ,S n ) G Tr(MV) by applying to each global state in the 
trace as follows 



However, <p(o(So)) may contain contradictory steps and thus not represent a meaningful abstracted trace. 
We say an abstracted trace ^)(o(Sq)) is valid iff there does not exist two identical states <j>(Sj) = (f>{Sj), 
for some i, j G {0, . . . ,n — 1}, such that 0(S,-+i) ^ <p(Sj + i). If 0(a(5o)) is a valid abstracted trace then 
we need to ensure it is in the canonical form introduced in Definition |2] We do this by removing any 
repeating tail that may have been introduced by the abstraction mapping, i.e. choose the smallest k, 
< k < n such that 0(5o), . . . ,<f>(Sk-i) are unique states and 0(5,-) = 0(Sjt), for some i G {0, — 1}. 
(Note whenever we talk about a valid abstracted trace we will assume it is in its canonical form.) 
We can lift to the trace semantics of a model MV: 



Hgi) = {0^0, 1^0,2^1} 



(1) {0^0,m0,2^1} 

(2) {0^0,1^1,2^1} 

(3) {0^0,m 1,2 i->0} 



(4) {0h>l,lh> 1,2^0} 

(5) {Oh^-1,1 ^0,2^0} 

(6) {0^1,1 ^0,2h 1} 



0(<T(So)) = <0(So),.--,0(S„)>. 



$(Tr(MV)) = {0(ff(S)) | o(S) G Tr(MV) and 0(ff(S)) is valid}. 



□ 
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Continuing with our running example, §{g 2 ) can be applied as an abstraction mapping to the trace 
semantics Tr{Ex\) (see Figure [2]) resulting in the abstracted trace semantics <p(g 2 )(Tr(Exi)), shown 
below in Figure [3) in which the states of g2 have been reduced accordingly. 

<Kg 2 )(cr(oo)) = (oo, 10, io) <K g2 )(<r(io)) = (io, io) 

<Hg 2 )(a(01)) = (00,11,00) <Hg 2 )(a(ll)) = (10,10) 
<Ks 2 )(a(02)) = (01,01) 0(g 2 )(<r(12)) = (11,00, 11) 



Figure 3: The trace semantics 0(g 2 )(7>(i?;tl)) resulting from abstracting Tr(Ex\) using 0(g 2 ). 

Note that <p(g 2 )(Tr(Exl)) is non-deterministic in the sense that we have two different traces begin- 
ning with the same state 00 (i.e. starting in state 00 we have a non-deterministic choice between two 
abstracted traces, (00, 10, 10) and (00, 11,00)). This occurs as we are viewing the more complex set of 
behaviours captured by Tr{Ex\) from a simpler perspective. 

To illustrate how invalid abstracted traces arise consider an MVN with two entities that has the 
following trace a (00) = (00,11,01,02,02). When a (00) is abstracted with the standard abstraction 
mapping 0(g 2 ) = {0 >->■ 0, 1 i->- 0,2 1->- 1} the result is the following 

<K g2 )((7(oo)) = (oo, 10,00,01,01). 

However, it can be observed that this is not a valid trace according to Definition [5] because global state 
00 can lead to two different states and will therefore be omitted from the abstracted trace semantics. 
We are now ready to define what it means for one MVN to be an abstraction of another. 

Definition 6. Let MV\ = {G\,Y\,N\,F\) and MV 2 = (G 2 ,Y 2 ,N 2 ,F 2 ) be two MVNs with the same 
structure, i.e. Gi = G 2 and Ni(gj) = N 2 (gi), for all gi € MV\. Let (j> be an abstraction mapping from 
MV 2 to MV\. Then we say that MV\ abstracts MV 2 under 0, denoted MV\ <^MV 2 , if, and only if, 
Tr(MV{) C (j)(Tr(MV 2 )). □ 

An abstraction MV\ MV 2 indicates that the model MV\ consistently abstracts the behaviour of 
a more complex model MV 2 by reducing the state space of those entities identified in the abstraction 
mapping . Note alternatively, we could consider MV 2 to be a refinement of MV\ in the sense that MV 2 
consistently extends MV\ with the addition of further states. Such a notion of refinement is useful as it 
provides a framework for the incremental development of MVN models. 

As an abstraction example, consider the MVN Ex2 defined in Figure [4] which has the same structure 
as Exl (see Figured]) but is a Boolean model. Then clearly, given the abstraction mapping (j)(g 2 ) intro- 
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a (00) = (00,11,00) 
a(01) = (01,01) 
a(10) = (10,10) 
a(ll) = (11,00,11) 



Figure 4: State transition tables defining Exl and its associated trace semantics Tr{Ex2). 

duced earlier, we can see that Tr(Ex2) C <p(g 2 )(Tr(Exl)) holds and so Exl is an abstraction of Exl, i.e. 
Ex2<^i)Ex\ holds. 
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In special cases, an abstraction may exactly capture the behaviour of the original MVN model under 
the given abstraction mapping. We distinguish this stronger case with the notion of an exact abstraction. 

Definition 7. Let MV\ and MV 2 be two MVNs such that MV\ <^ MV 2 for some abstraction map- 
ping (j). Then we say that MV\ exactly abstracts MV 2 under (p, denoted MV\ MV 2 , if, and only if, 
Tr(MV\) = §(Tr(MV 2 )) and for every a £ Tr(MV 2 ), the abstracted trace 0(a) is valid. □ 

Exact abstractions are interesting as they indicate redundant states (normally corresponding to en- 
tity thresholds) which have no affect on the qualitative behaviour of an MVN. Subsequently, an exact 
abstraction provides a simpler representation of an MVN whilst preserving all its behaviour under the 
given abstraction mapping. 

It is natural to consider whether every (non-Booleanfl MVN has an abstraction. In other words, do 
there exist MVNs which contain regulatory interactions which are too subtle to be represented in a sim- 
pler state domain. This is an interesting question since it provides insight into the need for non-Boolean 
MVN models. Unsurprisingly, it turns out that abstractions do not always exist, as formalized in the 
following theorem. 



Theorem 8. Not every non-Boolean MVN has an abstraction. 



Proof. We simply construct a non-Boolean MVN which we show has no abstractions. Let Ex3 be 
defined by extending Exl (Figure [T]) with a third Boolean entity #3 which is inhibited whenever g 2 is 
in a state greater than or equal to 1. The complete definition for Ex3 is given in Figure [5] We can see 
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Figure 5: The state transition tables defining Ex3 (used to prove Theorem [8]). 



that g2 G Ex3 now acts in two subtly different ways: on one hand g\ is inhibited when g 2 = 2; and on 
the other hand, g^ is inhibited when g 2 > 1. We can show that no abstraction exists for this model by 
exhaustively considering each possible abstraction mapping <p(g 2 ) and showing that for every possible 
candidate abstraction model MVa we have 7>(MVa) 2 ( P(g2){Tr(Ex3)). □ 



This is an important result which, although centered around the relationship assumption formalized 
by our abstraction theory, provides insight into the expressive power of MVNs and in particular, motivates 
the need for multi-valued modelling techniques. 

One of the main motivations for defining an abstraction theory is to allow simplified models of an 
MVN to be identified to aid the analysis process. This therefore raises the question of what properties of 
an abstraction are preserved by the original MVN and we end this section by considering this question. 



An MVN is said to be non-Boolean if it contains at least one entity which has more than two possible states. 
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We begin by introducing a notion of corresponding states and traces. 

Definition 9. Let MV be an MVN with an abstraction MVa under a given abstraction mapping , 
i.e. MVa <^ MV. Let S A G Smv a be some global state of abstraction MVa and S G Smv be a global state 
of the original model MV. Then we say that S A and S correspond with respect to <j), denoted S A <l^ 5, 
if, and only if, S A = </>(S). Furthermore, given traces a A G 7Y(MVa) and a G Tr{MV) we say o A and o 
correspond with respect to 0, denoted o A G, if, and only if, <j)(o) is valid and a A = 0(c). □ 

Let Si A S 2 denote the fact that global state S 2 G Smv is reachable from global state Si G Smv m 
the model MV. We now clarify the relationship between reachability properties in an abstraction and its 
corresponding original MVN model. 

Theorem 10. Let MV A <^ MV for some mapping abstraction and let S A ,S A G S M v A - If Sf A S A 
in MVa then there must exist states Si , S 2 G S M y such that Sf < Si , S A <^ S 2 , and Si A S 2 in MV. 

Proof. Since Sf A S2 there must exist a trace a(Sf) G 7Y(MVa) containing S^. From Definition 
we know that 7>(MVa) C 0(7>(MV)) must hold. Therefore there must exist a state Si G Smv suc h th at 
d(Sf ) a(Si), i.e. 0(a(Si)) = a(Sf ). From this it is straightforward to see that there must exist the 
required state S 2 in trace a (Si) such that S A S 2 and Si A S 2 . □ 

In other words, reachability properties of abstractions have corresponding reachability properties in 
the original MVN. However, since abstractions normally capture less behaviour than the original model, 
there are limitation on what can be deduced from an abstraction. It turns out that determining reachability 
in a model using an abstraction is a semi-decidable property: (i) By Theorem [10] we know that if one 
state is reachable from another in an abstraction then a corresponding reachability property must hold 
in the original model; (ii) However, if one state is not reachable from another in an abstraction then a 
corresponding reachability property in the original MVN may or may not hold and more analysis will be 
required. 

The final result we present is important as it shows that the attractor cycles found in an abstraction 
are preserved by the original MVN. 

Theorem 11. Let MVa <I^MVfor some abstraction mapping 0. Then 

ATT(MV A ) C 0(A7T(MV)). 

Proof. Let z G A7T(MVa) then we need to show that T G ${ATT(MV)). By definition we know there 
must exist a trace Oa G 7Y(MVa) such that att(oA) = 1. Since MVa <^ MV we know there must exist 
a trace a G Tr(MV) such that <p(o) is valid and Oa = 0(cr). It follows that T = <p(att(o)) and so by 
definition we know that x G <p (ATT(MV) ) as required. □ 

4 Identifying Model Abstractions 

In the previous section we defined a formal notion of what it means for one MVN to be a correct ab- 
straction of another. Given an MVN MV and an abstraction mapping (j) we can therefore define the set 
AS(MV, 0) of all abstractions of MV under 0, i.e. 

AS(MV, 0) = {MV A \ MV A < MV}. 



R. Banks & L. J. Steggles 



31 



Finding abstractions, i.e. members of AS(MV, (j)), is clearly an important task given that they provide a 
means of simplifying the analysis of a model and can help address the well-known problem of state space 
explosion. However, in practice, the brute force derivation of this refinement set becomes intractable for 
all but the smallest MVN. Specifically, if we have k entities each with n states, then we have a worst case 
upper bound of (n n ) k possible candidate models to consider for any abstraction mapping. For instance, 
there are (2 2 ) 3 = 16777216 possible Boolean networks consisting of just three entities! The rest of this 
section considers techniques for efficiently identifying abstractions and provides a basis for automating 
this task. Initial ideas for implementing these techniques are presented in |2). 

We begin by considering how an abstraction mapping can be applied to an MVN to produce a set of 
potential abstraction models. 



Definition 12. Let = {<p{gi), ■ ■ ■ ,(f>(gk)) be an abstraction mapping for an MVN MV. For each 
entity gj E MV we can abstract the next-state function f g . : Y(gj l ) x • • • x Y(g,- n ) — s> Y(gj) to a (possibly) 
non-deterministic next-state function 



Hf gi )-Hgh)(Y(g h ))x-.-x<l>(g in )(Y(g in )) 



)(Y(gi)) 



by applying (j) to its definition in the obvious way. We say that MV A results from applying (j) to MV iff: 

(1) MV A has the same entities and neighbourhood structure as MV; 

(2) The state space of each entity g; £ MV A is the set <$>(gi){Y (gi)); 

(3) For each gi € MV A its next-state function ff? A : <j>(g h )(Y(g h )) x • • • x (j>(g in )(Y ( gi J) -> p(gi){Y(g t )) 
is a deterministic restriction of <p(f gj )- 

We define 0(MV) to be the set of all such MVNs, i.e. 

(j)(MV) = {MV A | MV A results from applying to MV} 

The trace semantics of <j>(MV) is then defined by Tr((j)(MV)) = U M v A e<t>(MV) Tr(MV A ) 



□ 



To illustrate this idea, consider applying the abstraction mapping (g 2 ) = {0 h- > 0, 1 h-> 0, 2 1 } 
to the example MVN Exl introduced in Section [2] (see Figure [j}- The resulting abstracted next-state 
functions are presented in Figure [6l The set ty{g2){Ex\) will contain two candidate abstractions in which 
the state space for g2 is reduced to {0, 1 } and whose next-state functions are given by the two possible 
interpretations (highlighted in bold) for the abstracted state transition table for g2 given in Figure [6] 
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Figure 6: The (non-deterministic) state transition tables for <j>(g2)(Exl) which result from applying 
0(g2) to the state transition tables of Exl (FigureQ]). 
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An interesting observation arises by noting that for a given model MV and abstraction mapping , 
the trace semantics of the abstracted MVN Tr(<p(MV)) is not in general the same as the abstracted trace 
semantics 0(7>(MV)). In fact, it turns out that an important relationship exists between the two, in that 
Tr((j)(MV)) will always contain at least the traces of (j)(Tr(MV)), as shown by the following theorem. 

Theorem 13. Let = (<p(g\), . . . ,<p(gk)) be an abstraction mapping for MV. Then we have 

0(7>(MV)) C 7V(0(MV)). 

Proof. Let a = {So, . . . ,S n ) S Tr{MV) be an arbitrary trace, then we need to show that if 0(cr) is a valid 
abstracted trace then 0(a) <E Tr((j){MV)). Let S, — >-S;+i be an arbitrary state step in a. Assuming MV has 
k entities then this state step can be broken up into k components Si — > Sj +l , for j = l,...,k. Applying 
the abstraction mapping to each component gives 0(5,-) — > <p(g j)(Sj +l ). Clearly, by Definition [T2l there 
must exist MV A € (MV) whose next-state functions reproduce each of these abstracted component steps 
and so is able to reproduce the complete abstracted state step 0(5/) — > 0(S,-+i). Since 0(d) is a valid 
abstracted trace it follows that we must be able to find MV A € (MV) which is able to reproduce all the 
abstracted state steps 0(5;) — > 0(5,+i), for i = 0, 1. Thus, we know 0(d) € Tr(MV A ) and so by 
Definition (T2] we have 0(d) G 7>(0(MV)) as required. □ 

From this result, it follows that any abstraction of an MVN MV must be contained within the set of 
potential abstractions (MV) as formalized in the corollary below. 

Corollary 14. Given two MVNs MV\ and MV 2 we have that 

MVi<^MV 2 => MV l e(t>(MV 2 ). 

Proof. By Definition [6] we know Tr(MV\) C (j>(Tr(MV 2 )) and so by Theorem [B] we have Tr(MV\) C 
Tr((j)(MV 2 )). It therefore follows by Definition [[2] that MV\ G 0(MV 2 ). □ 

Corollary [l4]provides an important necessary condition for an MVN to be an abstraction of another 
for a given abstraction mapping. It gives us a way of restricting the models that need to be considered 
when iterating through possible candidate abstractions for an MVN; we simply apply the abstraction 
mapping to the MVN in question and then consider each possible deterministic model that results from 
this application. This observation results in an exponentially smaller search space and provides the basis 
for a more efficient abstraction finding algorithm. 

To illustrate the above ideas let us consider finding all the abstractions for Exl under <p(g 2 ), i.e. cal- 
culating the abstraction set AS(Ex\ ,<p(g 2 )). Using the results from Corollary [l4j we begin by abstracting 
the state transition tables for Exl using the given abstraction mapping (shown previously in Figure [6]) 
and identifying the potential abstractions contained in <p(g 2 )(Exl). We can see that the behaviour of g 2 
is non-deterministic when g\ = and g 2 = 0. As such, we have just two possible candidate models AB\ 
and AB 2 to consider, shown respectively by Figure 13(a) and Figure 13(b) (where the rules highlighted in 
bold are the only ones that differ). 

In order to verify whether AB\ and AB 2 are indeed abstractions according to our theory, we check 
if their trace semantics are contained within <p(g 2 )(Tr(Exl)). By considering Figure[3]and Figure [7]we 
can observe that AB\ is not an abstraction according to Definition [6l since Tr(AB\) % (f)(g 2 )(Tr(Exl)); 
in other words, its behaviour is not regarded as being consistent with Exl. On the other hand, we find 
that AB 2 is a correct abstraction as Tr(AB 2 ) C <p(g 2 )(Tr(Exl)). (Indeed, we can see that AB 2 is precisely 
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(a) Candidate model AB\ 



(b) Candidate model AB2 



Figure 7: The state transition tables and trace semantics for candidate models AB\ and AB2. 



the same MVN as Ex2 which was introduced as an abstraction in the previous section.) Thus, we have 
shown that the refinement set AS(Ex\, <p(gi)) = {AB2). 

It can be observed that exact refinements occur precisely when the translated MVN has a singleton 
set of candidate abstraction models, as shown by the following theorem. 

Theorem 15. Let <p be an abstraction mapping for some MVN MV. Then we know the following: 

(1) if 0(MV) = {MV A } is a singleton set, then MV A =^ MV; 

(2) if (j>(MV) is not a singleton set, then no exact abstraction for MV can exist under (j>. 

Proof. To prove (1), we observe that if §{MV) = {MV A } is a singleton set then for each gj € MV 
the abstracted next-state function <p(f gi ) must be deterministic. This implies that all abstracted traces 
0(a), for a € Tr(MV) must be valid. Furthermore, by Definition PT21 and Theorem [T"3l it follows that 
Tr(MV A ) = (j)(Tr(MV)) as required. 

To prove (2), note that if <p(MV) contains more than one potential abstraction model then there must 
exist at least one abstracted next-state function <j>(f gi ) which is non-deterministic. This implies there 
must exist at least one abstracted global state which leads to two or more different traces. Clearly, 
either some of these abstracted traces are invalid or ty(Tr{MV)) must contain more traces than any single 
abstraction model could capture. Therefore, there cannot exist an exact abstraction for MV. □ 

5 Illustrative Biological Examples 

In this section we illustrate the theory and techniques developed in the previous sections by investigating 
the existence of abstractions for two published MVN models for the genetic regulatory network control- 
ling the lysis-lysogeny switch in the bacteriophage A |[T7l l5l. We begin with a brief introduction to the 
bacteriophage A (see lfl4l for a more detailed introduction). 

The temperate bacteriophage A is a virus which infects the bacteria Escherichia coli HUES]]. After 
infection of the host cell, a decision is made by A based on environmental factors between two very 
different methods of reproduction, namely the lytic and lysogenic cycles lfl"8ll . In most cases, A enters 
the lytic cycle, where it generates as many new viral particles as the host cell resources allow before 
producing an enzyme to lyse the cell wall, releasing the new phage into the environment. Alternatively, 
the A DNA may integrate into the host DNA and enter the lysogenic cycle. Importantly, genes expressed 
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in the X DNA synthesize a repressor which blocks expression of other phage genes including those 
involved in its own excision. As such, the host cell establishes an immunity to external infection from 
other phages, and the phage X is able to lie dormant, replicating with each subsequent cell division of 
the host. 



5.1 The Two Entity Core Regulatory Model 

A simple MVN model of the core regulatory mechanism for the lysis-lysogeny switch was proposed 
in ifTTl . This model, which we denote as PL2, is presented in Figure [8] and is based on the cross- 
regulation between two regulatory genes, CI (the repressor gene) and Cro. It can be seen that Cro 
inhibits the expression of CI and at higher levels of expression, also inhibits itself. The gene CI inhibits 
the expression of Cro while promoting its own expression. The full synchronous trace semantics Tr(PL2) 
for this MVN is presented in Figure [U(c). We can see from the state transition graph in Figure [U(d) that 
PL2 has three attractor cycles, where the attractor cycle 10 — > 10 represents the lysogenic cycle since the 
repressor gene CI is fully expressed and 01 — > 02 — > 01 represents the lytic cycle. 
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(b) State transition tables 




(d) Graphical representation of traces 



Figure 8: Formal definition and trace semantics for the MVN model PL2 of the core regulatory mecha- 
nism for the lysis-lysogeny switch in bacteriophage X (taken from IfTTl ). 

In order to identify an abstraction for PL2 we begin by selecting an appropriate state mapping 
(f)(Cro) : {0, 1,2} — > {0, 1} for the only non-Boolean entity Cro. We use our understanding of the be- 
haviour of Cro to define the following state mapping 

(j)(Cro) = {0 ^ 0,1 i->- 1,2 i-> 1}. 



We can then view ty(Cro) as an abstraction mapping and following the approach in SectionHJ we restrict 
the abstraction search space by applying the abstraction mapping <p(Cro) to PL2. This results in a set 
(p(Cro)(PL2) which contains two candidate abstraction models. It turns out that only one of these is a 
correct abstraction and we present this abstraction model APL2 in Figure [9] It is straightforward to check 
that the trace semantics of APL2 (see Figure [9]) is indeed consistent with the abstracted trace semantics 
of PL2 (see FigurefTO]), i.e. Tr(APL2) C (j)(Crv)(Tr(PL2)). Thus, we know APL2 <<K Cro ) PL2 holds. 
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Figure 9: Abstraction model APL2 for PL2 and associated trace semantics Tr(APL2). 
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Figure 10: The traces <p(Cro)(Tr(PL2)) resulting from abstracting the traces of PL2 using (p(Cro). 

It can be seen that the abstraction APL2 acts as a good approximation to the behaviour of the original 
MVN PL2 and in particular, we can see that the abstraction has captured all three attractor cycles that 
were present in PL2. 



5.2 The Four Entity Regulatory Model 

The core regulatory model presented above was extended in ifTTl to take account of the actions of two 
further regulatory genes, CII and N. The resulting four entity MVN model PL4 is presented in Figure 
[TT1 (note that the state transition tables presented use a shorthand notation where an entity is allowed 
to be in any of the states listed for it in a particular row). This MVN is more detailed than PL2 and 
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Figure 1 1 : An extended MVN model PL4 of the control mechanism for the lysis-lysogeny switch in 
bacteriophage A (taken from lfT7lO . 



contains two entities with non-Boolean state spaces, namely CI with states {0, ... ,2} and Cro with states 
{0, . . . , 3}. The resulting state space for the model consists of 48 global states and for this reason we do 
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not reproduce its trace semantics here. Instead, we simply note that PL4 has the following three attractor 
cycles (where the first corresponds to the lytic cycle and the remaining two to the lysogenic cycle) 

0300 -»■ 0200 ->• 0300, 1000 -> 2100 -> 1000, 2000 -»• 2000 

We begin by looking to abstract the non-Boolean entities CI and Cm by defining appropriate state 
mappings. After considering the model, we define the following state mappings 

0(C7) = {0^0,1^1,2^ 1}, §{Cro) = {0^0,1^ 1,2 ^ 1,3 ^ 1}. 

which we use to define the abstraction mapping (j) = (^(CI),^(Cw),IciiJn)- Again, following the 
approach presented in Section|4]we first apply this abstraction mapping to PL4 resulting in the set (PL4) 
of candidate abstraction models. By analysing <p(PL4) we are able to establish that there are 256 possible 
candidate abstraction models (we have 4 choices for CI, 4 choices for Cro, 8 choices for Cll, and 2 
choices for N). After investigating these candidate models we were able to identify two abstractions for 
PL4 under <j) , denoted APIA \ < ^ PL4 and APL4 2 < ^ PL4, which are presented in Figure Q21 Interestingly, 
both abstractions appear to capture the key behaviour of PL4 in the sense that both contain the attractor 
cycles 0100 — > 0100 and 1000 — > 1000 which correspond to those present in PL4. 
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Figure 12: The transition tables for the two abstractions APL4\ and APIA-i identified for PL4 under 
(j>, where all the transition tables are the same except for CII where 011 — > 1 for abstraction APL4\ but 
011->0 for abstraction APL4 2 . 



6 Conclusions 

In this paper we have developed an abstraction theory for MVN models based on the idea of using an 
abstraction mapping to relate the reduced state space of an abstraction to the original model. The problem 
of identifying suitable abstractions for an MVN was discussed and some initial ideas for restricting the 
number of candidate abstraction models that need to be considered were proposed. We showed that 
abstractions can be used to analyse an MVN since they preserve reachability properties and importantly, 
since all the attractor cycles of an abstraction will correspond to attractor cycles in the original model. 
This work was motivated by the need to be able to relate MVN models at different levels of abstraction 
and in particular, the idea of abstracting an MVN to a simpler model which is more amenable to analysis 
and visualization techniques. The abstraction theory presented can also be seen as providing a framework 
for an incremental refinement approach to constructing MVNs. 
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We illustrated the abstraction theory and techniques developed by considering two examples based 
on published MVN models of the genetic regulatory network for the lysis-lysogeny switch in phage 
A ifTTl 151. We considered a simple two entity model and then an extended model that contained four 
entities (two of which were non-Boolean). In both cases we were able to identify meaningful Boolean 
abstractions which captured the key attractor cycles contained in the original models. 

Further work is now needed to build on the ideas presented in Section @] to develop tool support 
for automatically checking and identifying abstractions. Initial ideas for such tool support have been 
presented in Q and work is on going to develop efficient algorithmic solutions to support the abstraction 
process. Other researchers have considered abstracting MVNs by reducing the number of regulatory 
entities while preserving important model dynamics (see for example lfl~3ll20l ). It would be interesting 
to consider combining such an approach with the abstraction theory we have developed here. Finally, we 
note that extending the abstraction theory to asynchronous MVN models is an interesting but challenging 
area of future work. In particular, ways of coping with the non-deterministic choices inherent in the 
dynamic behaviour of asynchronous models will be needed. 
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